Mandarin Chinese Prosodic Phrase Grouping and Modeling—Method and Implications

نویسنده

  • Chiu-yu Tseng
چکیده

One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping [1, 2, and 3]. Phrasal and sentential intonations are governed by a prosody framework that structurally group phrases into a larger/longer and identifiable unit. An overall prosody pattern of such phrase grouping with prosodic specifications is superimposed on phrase group. In turn, individual phrasal intonation under prosody grouping has to adjust in accordance with structural specification from the prosody framework. The output is then seen as derived outcome. The aim of the present paper is to experiment how to simulate prosodic phrase grouping using the Fujisaki intonation model that originally specifies only phrasal or sentential intonations, and how such an intonation model can be further enhanced by incorporating prosodic specifications such as boundaries and breaks, prosody levels/layers and phrase positions under the notion of phrase grouping. The experiments began with aligning the phrase command of the intonation model to boundaries of breaks in the speech flow, then examining prosodic characteristics such as relative position of the target phrase within a prosodic phrase group. Finally, using a linear regression model to predict prosody output from prosodic words upward, predictions of an overall pattern for prosodic phrase grouping was derived. The pattern matched with a prosody base form aimed at prosodic phrase grouping; it also accounted for how and why phrasal intonations were modified in relation to prosody organization. Hence, phrasal intonation is seen as components of prosodic phrase grouping.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prosodic Fillers and Discourse Markers–Discourse Prosody and Text Prediction

Mandarin Chinese fluent speech prosody is characterized by a hierarchical multiple-phrase structure that specifies how speech paragraphs are constituted via Prosodic Phrase Grouping. Hence we view spoken discourse prosody as yet another higher node treats PGs (Prosodic Phrase Groups) as sister constituents. The goals of present study are two fold: one is to study how speech paragraphs are conne...

متن کامل

Fluent speech prosody: Framework and modeling

The prosody of fluent connected speech is much more complicated than concatenating individual sentence intonations into strings. Prosody framework and modeling should base on more understanding of both the production and perception of fluent speech. We analyzed speech corpora of read Mandarin Chinese discourses from a top-down perspective on perceived units and boundaries, and consistently iden...

متن کامل

Prosodic Word Grouping in Mandarin TTS System

This paper reports the methodology and results of prosodic word grouping for a Mandarin TTS system developed by the Fujitsu Laboratories. In view of any inner prosodic word break will make speech unintelligible or unnatural, a new prosodic word grouping framework is proposed. The word segmentation result can be regarded as an initial prosodic word sequence with grids inserted into each word bou...

متن کامل

Prosodic grouping in Chinese trisyllabic structures by multiple cues – tone coarticulation, tone sandhi and consonant lenition

Traditionally, the prosodic domain as has been called ‘foot’ in Mandarin Chinese is considered to be derivable from the application of Tone 3 sandhi rule. This study investigated the internal prosodic grouping of Chinese trisyllabic structures by examining multiple cues in parallel – tone coarticulation, tone sandhi application and consonant lenition. Analyses by tone coarticulation and consona...

متن کامل

Production and Perception of Tone 3 Focus in Mandarin Chinese

This study uses production and perception experiments to explore tone 3 focus in Mandarin Chinese. Overall, contrastive focus in Mandarin is clearly marked with increased duration, intensity, and pitch range: in the experiments, listeners identified focused syllables correctly more than 90% of the time. However, a tone 3 syllable offers a smaller capacity for pitch range expansion under focus, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004